403 research outputs found
Federated PAC-Bayesian Learning on Non-IID data
Existing research has either adapted the Probably Approximately Correct (PAC)
Bayesian framework for federated learning (FL) or used information-theoretic
PAC-Bayesian bounds while introducing their theorems, but few considering the
non-IID challenges in FL. Our work presents the first non-vacuous federated
PAC-Bayesian bound tailored for non-IID local data. This bound assumes unique
prior knowledge for each client and variable aggregation weights. We also
introduce an objective function and an innovative Gibbs-based algorithm for the
optimization of the derived bound. The results are validated on real-world
datasets
Growth of nonsymmetric operads
The paper concerns the Gelfand-Kirillov dimension and the generating series
of nonsymmetric operads. An analogue of Bergman's gap theorem is proved,
namely, no finitely generated locally finite nonsymmetric operad has
Gelfand-Kirillov dimension strictly between and . For every or , we construct a single-element
generated nonsymmetric operad with Gelfand-Kirillov dimension . We also
provide counterexamples to two expectations of Khoroshkin and Piontkovski about
the generating series of operads.Comment: 32 pages, 9 figure
ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models
Large language models (LLMs) have recently demonstrated their potential in
clinical applications, providing valuable medical knowledge and advice. For
example, a large dialog LLM like ChatGPT has successfully passed part of the US
medical licensing exam. However, LLMs currently have difficulty processing
images, making it challenging to interpret information from medical images,
which are rich in information that supports clinical decisions. On the other
hand, computer-aided diagnosis (CAD) networks for medical images have seen
significant success in the medical field by using advanced deep-learning
algorithms to support clinical decision-making. This paper presents a method
for integrating LLMs into medical-image CAD networks. The proposed framework
uses LLMs to enhance the output of multiple CAD networks, such as diagnosis
networks, lesion segmentation networks, and report generation networks, by
summarizing and reorganizing the information presented in natural language text
format. The goal is to merge the strengths of LLMs' medical domain knowledge
and logical reasoning with the vision understanding capability of existing
medical-image CAD models to create a more user-friendly and understandable
system for patients compared to conventional CAD systems. In the future, LLM's
medical knowledge can be also used to improve the performance of vision-based
medical-image CAD models
Recordism: A social-scientific prospect of blockchain from social, legal, financial, and technological perspectives
Blockchain has the potential to reform the architecture of cyberspace and
transform the storage, circulation and exchange of information through
decentralization, transparency and de-identification. Meaning that ordinary
participants can become traders, miners, retailers, and customers
simultaneously, breaking the barriers and reducing the information gap between
participants in the community, contributing to the futuristic metaverse with an
open progressive and equal ideology. Such information transformation empowered
by blockchain also profoundly impacts our methodological cognition, legal
governance on cyberspace and financial and technological development.
This study explores the main question: what are the implications of the
blockchain-driven information revolution for society and social sciences? In
order to answer this main question, this paper chooses four perspectives, which
are methodological, legal, financial and technical. By analysis of these four
perspectives, this paper is expected to provide a more comprehensive analysis
of the blockchain-driven impact on society, social sciences, and technology to
contribute to current scholarships. Additionally, regarding blockchain as an
innovative methodological cognition, it grows on top of other technologies
while helping advance other technologies. This paper concludes that although
there are few frictions between blockchain and current social architecture,
blockchain is so much more than the technology itself, that can be a
representative of the community, acting as the source of trust, watcher of
governance, law enforcer for virtual activities, and an incubator for future
technologies
Neural Point Process for Learning Spatiotemporal Event Dynamics
Learning the dynamics of spatiotemporal events is a fundamental problem.
Neural point processes enhance the expressivity of point process models with
deep neural networks. However, most existing methods only consider temporal
dynamics without spatial modeling. We propose Deep Spatiotemporal Point Process
(\ours{}), a deep dynamics model that integrates spatiotemporal point
processes. Our method is flexible, efficient, and can accurately forecast
irregularly sampled events over space and time. The key construction of our
approach is the nonparametric space-time intensity function, governed by a
latent process. The intensity function enjoys closed form integration for the
density. The latent process captures the uncertainty of the event sequence. We
use amortized variational inference to infer the latent process with deep
networks. Using synthetic datasets, we validate our model can accurately learn
the true intensity function. On real-world benchmark datasets, our model
demonstrates superior performance over state-of-the-art baselines. Our code and
data can be found at the https://github.com/Rose-STL-Lab/DeepSTPP
ResiDualGAN: Resize-Residual DualGAN for Cross-Domain Remote Sensing Images Semantic Segmentation
The performance of a semantic segmentation model for remote sensing (RS)
images pretrained on an annotated dataset would greatly decrease when testing
on another unannotated dataset because of the domain gap. Adversarial
generative methods, e.g., DualGAN, are utilized for unpaired image-to-image
translation to minimize the pixel-level domain gap, which is one of the common
approaches for unsupervised domain adaptation (UDA). However, the existing
image translation methods are facing two problems when performing RS images
translation: 1) ignoring the scale discrepancy between two RS datasets which
greatly affects the accuracy performance of scale-invariant objects, 2)
ignoring the characteristic of real-to-real translation of RS images which
brings an unstable factor for the training of the models. In this paper,
ResiDualGAN is proposed for RS images translation, where an in-network resizer
module is used for addressing the scale discrepancy of RS datasets, and a
residual connection is used for strengthening the stability of real-to-real
images translation and improving the performance in cross-domain semantic
segmentation tasks. Combined with an output space adaptation method, the
proposed method greatly improves the accuracy performance on common benchmarks,
which demonstrates the superiority and reliability of ResiDuanGAN. At the end
of the paper, a thorough discussion is also conducted to give a reasonable
explanation for the improvement of ResiDualGAN. Our source code is available at
https://github.com/miemieyanga/ResiDualGAN-DRDG
- …